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PB39, A GENE DYSREGULATED IN PROSTATE CANCER, 



AND USES THEREOF 



FIELD OF THE INVENTION 

This invention relates to a novel gene, PB39, and variants thereof, which is dys- 
regulated in prostate cancer. This invention also relates to polypeptides encoded by these 
genes, antibodies to the polypeptides, and to methods for detection of PB39, variants thereof, 
and/or its gene product. These methods can be used to assess the presence of precancerous or 
cancerous cells in the prostate gland. 



BACKGROUND OF THE INVENTION 



Prostate cancer is the most common malignancy in men and the second leading cause 
of cancer mortality in the United States (Wingo P. et al., Cancer statistics, Cancer J Clin 45: 
8-30 (1995)). The disease can progress with markedly different clinical outcomes. An 
understanding of the difference between aggressive and nonaggressive tumors at the 
molecular level has been hindered by the diverse cell types present in the prostate gland and 
an inability to derive pure cell populations for genetic study. 

The Gleason score is a reliable prognosticator for disease progression at both ends of 
the histologic spectrum (<4 or Poor disease outcome in the more prevalent Gleason 
range (5 to 7) is best predicted by a positive surgical margin, capsular penetration, and 
seminal vesicle invasion (Epstein J. et al, Prediction of progression following radical 
prostatectomy: a multivariate analysis of 721 men with long-term follow-up, Am J Surg 
Pathol 20: 286-292 (1996)). However, these parameters can only be assessed after major 
surgery. Certain cases of prostate cancer respond to surgical and/or pharmaceutical 
intervention. Others metastasize rapidly, by mechanisms that remain poorly understood, to 
tissues such as bone. 

Currently available diagnostic methods directed to the detection of prostate cancer 
focus largely on prostate specific antigen (PSA). This protein is known to be a glycosylated 
serine protease, and is produced in relatively large quantities in the epithelial cells of the 
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prostate and is secreted in seminal fluid. Smaller amounts are detected in the serum of 
healthy individuals; higher serum concentrations are thought to be correlated with 
pathological conditions such as prostate cancer. Other diagnostic methods address nucleic 
acids differentially expressed in prostate cancer. 
5 U. S. Patent 5,674,682 issued Oct. 7, 1997, to Croce et al. relates to methods for 

detecting prostate cancer micrometastasis, in which a sample containing nucleic acids is 
amplified and probed by hybridization to particular oligonucleotide probes. These probes are 
disclosed as being specific for prostate cancer. 

U. S. Patent 5,658,730 issued Aug. 1 9. 1 997, to McGill et al., entitled "Methods of 
10 Human Prostate Cancer Diagnosis", discloses diagnostic techniques for the detection of 
^ human prostate cancer. A set of degenerate probes is used to detect gene amplification in 
%j prostate cancer cells at regions of chromosome 8q24. 1-24.2. A comparison of the probe 

• % sequence with the specific primers used reveals no significant segments sharing identity 

■M between either of the specific primers and the disclosed probe. Copy number changes of 

i'X! 

'l J! 5 chromosome 8q serve as a marker for the development of aggressive prostate cancers. 

U. S. Patent 5,622,829 issued Apr. 22, 1997, to King et al., entitled "Genetic Markers 
for Breast, Ovarian and Prostatic Cancer", discloses the nucleotide sequences for several 
m alleles of B RCA L The specification suggests ascertaining men at risk for prostatic cancer in 
view of female siblings or family members diagnosed for breast cancer. Several alleles of 
20 BRCA1 are disclosed. A method of screening a patient for prostatic cancer susceptibility 

based on hybridizing with nucleic acids comprising the sequences provided is also claimed. 

U. S. Patent 5,614,372 issued March 25, 1 997, to Lilja et al. relates to a bioaffinity 
assay of PSA using monoclonal antibodies in which a measure of PSA is related to the total 
of the concentration of PSA plus human glandular kallikrein-l present in a sample of body 
25 fluid. The PSA level determined may be either the concentration of free PSA (i.e., 

uncomplexed PSA) or the concentration of PSA complexed with alpha-1 anti-chymotrypsin. 
The assay results permit discrimination between benign prostatic hyperplasia and prostate 
cancer. 

U. S. Patent 5,552,277 issued Sep. 3, 1996, to Nelson et al., entitled "Genetic 
30 Diagnosis of Prostate Cancer", teaches that prostatic glutathione-S-transferase promoter 
becomes hypermethylated in most prostatic cancers. Nelson et al. discloses a method of 
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detecting the hypermethylated promoter in view of its altered susceptibility to a methylation- 
sensitive restriction nuclease. 

U. S. Patent 5,543,296 issued Aug. 6, 1996, to Sobol et aL entitled "Detection of 
Carcinoma Metastases by Nucleic Acid Amplification", discloses a method for detecting 
5 metastasis of a prostate carcinoma which entails treating a sample of non-prostate tissue in 
such a way as to amplify mRNA for PSA. A method for detecting carcinoma metastases in 
body tissues and fluids is disclosed only in general, broad terms; prostatic acid phosphate [sic, 
phosphatase] and PSA are mentioned only at col. 12, 1. 23-26. Several primers are disclosed. 
U. S. Patent 5,506,106 issued Apr. 9, 1996, to Croce et aL entitled "Methods of 
10 Detecting Micrometastasis of Prostate Cancer", discloses a procedure for diagnosing prostate 
cancer metastasis by seeking mRNA from PSA among the population of nucleated cells in a 
blood sample, using RT-PCR with PSA-specific primers. 

11 S. Patent 5,501,983 issued Mar. 26, 1996 to Lilja et aL, entitled "Assay of Free and 
Complexed Prostate-Specific Antigen", discloses methods of immunoassay for measuring 
15 PSA both free and as a proteinase inhibitor complex. 

In spite of these advances, there remains a need to develop prognostic tumor markers 
that can be measured in limited needle biopsies early in the progression of prostate cancer, 
especially in the case of aggressive prostate cancer. There further remains a need to address 
more selective and specific assays for various types of prostate cancer. Assays are needed 
20 that permit distinguishing prostate cancer from non-neoplastic prostate disease. Furthermore 
there is a need to develop diagnostic methods that facilitate identifying prostate cancers of 
differing aggressiveness and metastatic potential. There is likewise a need for the diagnostic 
probes on which such methods are founded. This invention addresses these needs. 

25 SUMMARY OF THE INVENTION 

The invention relates to a novel gene, PB39, that is up-regulated, or over-expressed, in 
prostate cancer. The gene has been identified by means of its cDNA obtained by reverse 
transcription of the corresponding mRNA. Microdissection of prostate glands that had been 
30 surgically removed from prostate cancer patients revealed a novel up-regulated transcript in 
an aggressive prostate carcinoma. Differential analysis for the presence of this gene was 
carried out from the same glands by comparing transcription in microdissected normal 
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prostatic epithelium versus that in microdissected invasive tumor. The transcript was over- 
expressed in 5 of 10 prostate carcinomas examined. A variant transcript was over-expressed 
in 4 of 4 prostate carcinomas, and was found in only 1 of 4 normal samples. 

The invention provides a purified and isolated nucleic acid that includes the sequence 

5 of PB39 given in SEQ ID NO:l or its complement, and the sequence of the PB39 variant 
given in SEQ ID NO:3 or its complement. In particular embodiments, the nucleic acid may 
be an RNA or a cDNA. Additionally, the invention provides a purified and isolated nucleic 
acid, such as a primer or probe, that includes a sequence that is a fragment of the PB39 
sequence or its complement, or a fragment of the PB39 variant or its complement. Examples 

)0 of such primers are given in SEQ ID NO:7, SEQ ID NO;8, SEQ ID NO;10, SEQ ID NO:l 1, 
and SEQ ID NO:12. 

The invention additionally provides polypeptides, whose sequences are given in SEQ 
ID NOs:2 and 4. that are encoded by nucleic acids that include either the sequence for PB39 
or the sequence for the PB39 variant. In one embodiment, this polypeptide is a recombinantly 

1 5 produced polypeptide. The invention furthermore provides an antibody that binds 
immunospecificaily with PB39 or PB39 variants. 

A method of detecting precancerous cells or cancer cells in the prostate of a subject is 
also provided in this invention. The method includes providing a sample of tissue or fluid 
from the subject and determining whether the sample contains an abnormally high content of 

20 a nucleic acid that includes the sequence of PB39 given in SEQ ID NO: 1 or its complement, 
or the sequence of the PB39 variant given in SEQ ID NO:3 or its complement, or a fragment 
of these sequences. A finding that the sample contains an abnormally high content of the 
nucleic acid indicates that the subject has precancerous cells or cancer cells in the prostate. In 
an important embodiment of this method, the determining step includes amplifying the 

25 nucleic acid and detecting the amplified nucleic acid. 

Additionally the invention provides a method of detecting precancerous or cancer 
cells in the prostate of a subject. This method includes providing a sample of tissue or fluid 
from the subject and determining whether the sample contains an abnormally high content of 
a polypeptide that is the gene product of the PB39 gene or the variant PB39 gene. Finding 

30 that the sample contains an abnormally high content of the polypeptide indicates that the 

subject has precancerous or cancer cells in the prostate. In an important embodiment of this 
method, the determining step further includes contacting at least a portion of the sample with 
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an antibody that binds immunospecifically with the polypeptide and determining the amount 
of the antibody that has bound with the polypeptide present in the sample. In particular 
embodiments of both of these methods, the sample may be a body fluid, or may be tissue 
originating from the prostate. 

BRIEF DESCRIPTION OF THE DRAWING 



Figure 1 . Panel a. Denaturing electrophoresis gel comparing reverse transcription- 
polymerase chain reaction (RT-PCR) amplification products produced by arbitrary and zinc 

10 finger primers. Five tumor samples (T) and two normal samples (N) from NCI Patient 1542 
are shown. Two bands increased in the tumor samples are present (arrow and arrowhead). 
R00504 is represented by the arrow. Panel b. Denaturing electrophoresis gels showing 
R00504 overexpression in two cases (Table 1, cases 1 and 2). Left: RT-PCR amplification of 
R00504 in normal (N) and tumor (T) samples. Right: Amplification of beta-actin from the 

1 5 same samples. +, reverse transcriptase reaction with Moloney Murine Leukemia Virus 
(MMLV); MMLV reverse transcriptase was replaced by water; C, positive control. 

Figure 2. Northern blot of fetal tissue samples using the specific probe R00504. 
Lanes contain RNA as follows: lane 1 , kidney; lane 2, liver; lane 3, lung; and lane 4, brain. 
Standards are shown on the right. 

20 Figure 3. Panel A. Clontech adult tissue Northern blots probed with radiolabeled 

R00504 insert. The exposure times of the blots on the left and right were 40 hrs and 6 hrs, 
respectively. Panel B. Clontech fetal tissue Northern blot probed with radiolabelled R00504 
insert. The exposure time was 40 hrs. 

Figure 4. Panels A and B. Nucleotide and amino acid sequence of PB39. The 

25 nucleotide sequence is numbered on the left and the amino acid sequence numbered on the 

right. The underlined ATG start is at nucleotide position 77. For the two sequences beginning 
at nucleotide position 1613, the upper nucleotide and amino acid sequences refer to the 2.3 kb 
transcript. The lower nucleotide and amino acids sequences refer to the 5 kb transcript. Panel 
C. Sequence overlap and divergence between 2.3 kb and 5 kb transcripts (upper and lower, 

30 respectively). Open reading frame (between arrowheads), 5' untranslated region (UTR) 

(vertical fine pattern), 3' UTR (area downstream of arrowhead), inserted sequence of the 5kb 
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transcript (black), position of divergence (arrow). The white area corresponds to the same 
sequence in both transcripts. Representative EST clones are identified below each transcript. 

Figure 5. Clontech fetal tissue Northern blot probed with radiolabeled 5 kb transcript 
specific probe. The exposure time was 5 days. The amount of RNA loaded on all of the gels 
5 was adjusted to give similar beta-actin hybridization signals and represents approximately 2 
jLtg of polyA-selected mRNA. 

Figure 6. Regional localization of PB39. Panel A. Localization to chromosome 1 Ip. 
4 t 6'-Diamino 2-phenyl-indole (DAPI ) counterstained metaphase showing the location of the 
PB39 gene (red, at arrows) on the short arm of chromosome 1 1 . Panel B. "G-banded" 
10 chromosomal analysis. An example of the DAPI image of Panel A showing chromosome 1 1 
which was converted to a black and white "G-band" image to show the position of PB39 on 
the short arm of chromosome 1 1 close to the centromere. The position of PB39 is identified at 
1 Ipl 1 .1 -1 1 .2. Panel C. DAPI and simulated G-banded image of chromosome 1 1 after 
fluorescence in situ hybridization (FISH). Arrows show location of PB39. Panel D. 
1 5 Ideogram of chromosome 1 1 . 

Figure 7. Western blots of PB39 probed with polyclonal antibodies raised against 
peptides 646 (Panel A), 644 (Panel B) and 656 (Panel C). 



DETAILED DESCRIPTION OF THE INVENTION 

20 

The gene designated PB39 identified in the present invention includes a nucleotide 
sequence previously found in expressed sequence tag (EST) R00504 (GenBank). The 
procedure that led to its discovery and identification depends on an analysis of differential 
transcription of the gene in neoplastic tissue from a prostate gland that is cancerous compared 

25 to normal tissue from the same gland. Specifically, microdissection of epithelial tissue from 
prostate glands that had been surgically removed from prostate cancer patients was carried 
out. PB39 in microdissected invasive tumor compared to that in microdissected normal 
prostatic epithelium was found to be over-expressed in 5 of 10 prostate carcinomas examined. 
These findings identify the condition detected as prostatic intraepithelial neoplasia, the 

30 earliest precursor of prostate cancer. Reliable evaluation of overexpression was aided by 
comparing transcription to that of the constitutively expressed gene for beta-actin. 
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The finding that PB39 is over-expressed in prostatic intraepithelial neoplasia indicates 
that the gene and its gene product are useful in the early diagnosis of this disease, and that 
they may serve as a marker for its early appearance. Methods that may be employed for this 
purpose include, without limiting the scope of the invention, assay for the gene by the 
5 polymerase chain reaction and in situ hybridization, and analysis for the protein product by 
immunoassay, or by immunohistochemical detection such as indirect immunofluorescence. 
Probes and antibody reagents may be developed to permit imaging prostate cancer, both 
primary and metastatic. Isolation of the PB39 gene and gene product will contribute to the 
understanding of prostate cancer development and progression, based on experimental studies 
10 using methods such as immunohistochemistry or in situ hybridization. The availability of 
^ PB39 will additionally permit the development of methods of treatment of subjects 

; L t determined to have prostate cancer. Specifically, treatment modalities such as chemotherapy, 

Jp immunotherapy, or antisense nucleotides, for example, may be developed to target the 

■5 prostate cancer identified by PB39. The gene may be applied to produce the recombinant 

\ Hi 5 gene product in appropriate host cells, especially in mammalian cells. The recombinant gene 

product then may serve as the immunogen to provide antibodies to be applied in the various 
III methods mentioned above. 

Initial identification of PB39 has been made as a result of experiments performed on 
Q surgically excised cancerous prostate glands. The glands were frozen in liquid nitrogen as 

' " :f 20 rapidly as possible in order to preserve all metabolically unstable species that may be present 
as well as possible without breakdown. Particular species of interest in the present invention 
are actively transcribed mRNA species leading to expression of genes that may be 
characteristic of the cancerous state. RNA was extracted from microdissected invasive tumor 
cells and from corresponding normal prostatic epithelium from the same gland, and the gene 
25 expression profile was examined in the two sets of cells as they exist in vivo. 

Microdissection was carried out on sections of the prostate under low power 
microscopic magnification with sufficient resolution and care to isolate only the types of 
tissue desired (Chuaqui, R. et ah,. Identification of a Novel Transcript Up-Regulated in a 
Clinically Aggressive Prostate Carcinoma, Urology 50: 302-307 (1997), incorporated herein 
30 by reference). In addition to the microscopic field visualized, guidance was further obtained 
from adjacent sections that have been treated and stained to enhance the identification of 
histological features. The dissection procedure was carried out as rapidly as possible in order 
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to minimize degradation of RNA species by endogenous processes. The microdissection 
technique is further characterized in Emmert-Buck et al. (Increased gelatinase A and 
cathepsin B activity in invasive tumor regions of human colon cancer samples, Am J Pathol 
145:1285-1290 (1994)) and Zhuang et al. (A microdissection technique for archival DNA 
5 analysis of specific cell populations in lesions <1 mm in size, Am J Pathol 146:620-625 
(1995)) both of which are incorporated herein by reference. 

Each sample of microdissected tissue is then extracted with a procedure, such as the 
guanidinium thiocyanate-phenol-chloroform method (Chomczynski et al., Single-step of 
RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction, Anal 
10 Biochem 162: 156-159 (1987)), in order to provide an RNA sample that may be assessed for 
the presence of genes of interest. The samples should contain sufficient numbers of cells to 
< ; y provide RNA for the subsequent manipulations. Commonly it is expected that at least about 

; £ 5,000 cells, and preferably at least about 10,000 cells will be treated for a given sample. If 

necessary, further purification of the isolated RNA may be carried out. This may include, 
PLf5 without intending to limit the purification methods employed, steps such as digestion with 
deoxyribonuclease, and further purification steps known to workers of skill in cell biology, 
| s f molecular biology, and cancer research that permit one to isolate mRNA that is actively 
M involved in translation of genes to yield protein products. Such methods are set forth in 

general in "Current Protocols in Molecular Biology", Ausubel et al., John Wiley and Sons, 
' 5£o New York 1987 (updated quarterly), and "Molecular Cloning: A Laboratory Manual", 2nd 
Ed., Sambrook, Fritsch and Maniatis, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1989, which are incorporated herein by reference. 

The purified mRNA sample is then reverse transcribed to yield the corresponding 
cDNA species using procedures well known to workers of skill in cell biology, molecular 
25 biology, and cancer research (Ausubel et al., Sambrook et al.). The cDNA sample is then 
further analyzed for species that contain sequence motifs known to bind DNA, such as zinc 
finger motifs. This is carried out by using PCR with primers designed to amplify the motifs 
sought. In particular, the present invention discloses that one such primer is an arbitrary 
primer developed by Stratagene (La Jolla, CA), primer A2 (AATCTAGAGCTCCAGCAG 
30 (SEQ ID NO:5)), and a zinc finger-directed primer (Zinc 2, 

GTCGTCGAATTCCACACAGGAGAAAAGCC (SEQ ID NO:6; Stone et al., Targeted 
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RNA fingerprinting: the cloning of differentially-expressed cDNA fragments enriched for 
members of the zinc finger gene family, Nucl Acids Res 22: 2612-2618 (1994)). 

This PCR generally provides an amplified DNA sample enriched in DNA molecules 
encoding gene products having zinc finger motifs. The present inventors discovered 
unexpectedly that differential display gels of PCR products obtained using this primer pair 
yielded a particular band having pronounced expression in prostatic cancerous epithelium but 
not in normal epithelium. Upon partial sequencing of this DNA, it was found that a portion 
of the sequence matched a sequence appearing in the EST database, GenBank accession no. 
R00504, 

Primers specific for R00504 were designed. These are 
GCATGTTACAGGTAGAAAAGCC (SEQ ID NO:7) and 
CTGGCGTATCTGAAGAGTCTG (SEQ ID NO:8). 

These primers may be employed for specific binding to and amplification of sequences 
contained within R00504, and as such are useful in probing the gene overexpressed in 
cancerous epithelium. When these R00504-specific probes were labeled and employed in 
Northern blot analysis of mRNA species from prostatic epithelium samples, a molecule of 
approximately 2.6 kb was identified. The gene defined in this mRNA transcript is termed 
PB39. The complete nucleotide sequence for PB39 has been obtained and is provided in 
SEQ ID NO: 1 . The sequence of the protein encoded by PB39 is provided in SEQ ID NO:2. 
The details of the procedures used to obtain this result are provided in Example 6. 

It was subsequently determined that a variant of PB39 results from an alternative 
RNA splicing mechanism during maturation of the RNA transcript. This variant was 
analyzed by PCR, including using a probe specific for the sequence inserted into the long 
form. Sequencing of the resulting amplified DNA showed the variant to have a different 
sequence, and a termination codon yielding a translated gene product one amino acid residue 
longer than the 2.6 kb form. The nucleotide sequence of the variant PB39 up to its 
termination codon is provided in SEQ ID NO:3, and the sequence of the protein encoded by 
this variant is given in SEQ ID NO:4. The details of these procedures and of the results 
obtained are provided in Example 7. 

The transcripts corresponding to SEQ ID NOs: 1 and 3 are found to occur in 
abnormally high concentrations in samples derived from cells of prostate cancer epithelium 
when referred to the level found in cells from normal epithelium from the same prostate 
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gland. This observation forms the basis of a method of detecting precancerous cells or cancer 
cells in the prostate of a subject. The level of PB39 transcript is determined, for example, by 
using reverse transcription-PCR, Northern blot analysis, or comparable methods known to 
workers of skill in cell biology, molecular biology, and cancer research. In these methods, 
the PCR may be carried with any primer pair specific for PB39. including but not limited to, 
primers that contain the sequences of SEQ ID NOs;7, 8, 10, 11, and 12. The probes to be 
applied in Northern blot analysis are to be labeled, and may be based on the primers 
including the sequences of SEQ ID NOs:7 s 8, 10, 1 1, and 12 mentioned above, as well as on 
labeled forms of nucleic acids containing the PB39 sequences given by SEQ ID NOs:l and 3 
or their complements. The probes for Northern analysis may also employ fragments of the 
PB39 sequences given by SEQ ID NOs:l and 3 or their complements, with the proviso that 
such fragments are specific for PB39 and are long enough to hybridize effectively with the 
target sequence in the sample being probed. In each case, the levels of the PB39 transcripts 
are normalized by taking the ratio of the level of the transcript found to the level of the 
transcript for a constitutive gene present in the same cells. An example of a gene for a 
constitutively expressed transcript is that for beta-actin. The present invention also provides 
diagnostic kits including the above novel nucleic acids, primers and probes for purposes of 
carrying out this method of detection. 

The method of detecting precancerous or cancer cells determines whether the level of 
PB39 transcript is present at an abnormally high level. An "abnormally high' 1 level, or 
content, of the nucleic acid transcript, as used herein, relates to a ratio of the level of PB39 
transcript to the level of beta-actin transcript that is preferably about two times or more 
higher in the sample of the cancerous epithelium than that found in a set of samples taken 
from normal epithelium, as expressed by a mean value found therein. 

In performing this method, the sample from the subject may be a biopsy sample 
drawn from the prostate gland of the subject. In favorable cases, such a biopsy may be 
obtained in a procedure that minimizes invasiveness and discomfort to the subject, such as a 
needle biopsy. Alternative samples may be a body fluid from the subject, including but not 
limited to, blood, urine, and seminal fluid. Generally, sampling methods and choices are well 
known to workers of skill in the art such as urologists and oncologists. 

The gene products encoded by the nucleotide sequences of SEQ ID NOs:l and 3 are 
provided in SEQ ID NOs:2 and 4 5 respectively. The proteins incorporating these amino acid 
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sequences may be produced as recombinant proteins in host cells modified by vectors 
containing nucleic acid sequences, such as the sequences of SEQ ID NOs:l and 3, that 
encode the proteins. Such recombinant proteins may be produced in prokaryotes such as 
Escherichia colL Preferably, however, eukaryotic hosts will be employed. For example, host 
5 cells from members of many families of Lepidoptera, such as SF-9 cells, may be employed. 
Such host cells are modified to produce the desired protein by infection with a recombinant 
baculovirus. Autographa California, wherein the recombinant baculovirus carries the gene for 
the heterologous protein, commonly under the control of the promoter for the gene for the 
polyhedrin protein of the virus. 
10 Alternatively, various mammalian cells may be used to produce the PB39 protein 

ih product upon being transfected with an appropriate vector harboring the gene including the 

f = sequence of SEQ ID NO: 1 or SEQ ID NO;3. Methods for preparing the vectors and 

W producing the recombinant proteins are set forth broadly in general terms in Ausubel et al. 

fy and Sambrook et al, for example. These procedures are well known to workers of skill in the 

y 15 fields of cell biology and molecular biology. The PB39 protein is to be produced as a 
C| recombinant protein in a system such as summarized herein, or an equivalent system, and 
I^l purified to a high degree of purity. Procedures that may be employed in the purification 

jfj include fractional precipitation, chromatography, centrifugation, and the like. Such 

f|| procedures are well known to workers of skill in protein chemistry and enzymology and are, 

20 for example, set forth in Deutscher, M. P- (ed.), Guide to Protein Purification: Methods in 
Enzymology. Vol. 182, Academic Press, San Diego, CA, 1990; and Scopes, R. K., Protein 
Purification: Principles and Practice, 3rd Ed., Springer-Verlag, New York, NY, 1993. 

The purified PB39 protein corresponding to SEQ ID NO:2 or SEQ ID NO:4 may be 
used as the immunogen to prepare antibodies against the respective proteins. The antibody 
25 may be a polyclonal antibody, obtained upon immunizing a host such as a rabbit with the 

PB39 immunogen. Alternatively, one or more monoclonal antibodies may be obtained upon 
immunizing a host such as a mouse with the PB39 immunogen, and preparing hybridomas 
that secrete the antibody. Successful hybridomas are obtained as a result of probing 
hybridoma clones for antibody molecules that bind immunospecifically with the PB39 
30 immunogen. These procedures are well known to workers of skill in the field of molecular 
immunology, and are set forth in general terms in Harlow and Lane, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1988), and 
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Coligaru J.. A. Kruisbeek, D. Margulies. E. Sevach, and W. Strober, "Current protocols in 
immunology", John Wiley & Sons. New York (1994), which are incorporated herein by 
reference. 

Antibodies produced according to the procedures described above are useful in 
immunoassays directed to detecting precancerous cells or cancer ceils in the prostate of a 
subject. In such immunoassays, for example in an enzyme linked immunosorbent assay 
(ELISA), the concentration of PB39 antigen is detected in a sample obtained from the 
subject. If the sample is a fluid sample, it may be used as is, or treated to remove cellular 
components, and furthermore may be volumetrically diluted if necessary to attain an 
appropriate concentration of the antigen. If the sample is a sample of cells from the prostate 
gland the sample may be homogenized and the fraction containing the PB39 antigen may be 
concentrated. If desired, the prostate sample may be microdissected first to provide prostate 
epithelium. In an ELISA, by way of nonlimiting example, a first antibody that binds 
immunospecifically with PB39 is immobilized on the surface of a suitable assay vessel, and 
the remaining surface sites are then blocked with an innocuous protein. The sample 
suspected of containing PB39 antigen is added and allowed to react with the immobilized 
first antibody. After thorough rinsing, any immobilized antigen is further treated with a 
second antibody that binds immunospecifically with PB39, and the second antibody is 
detected. These procedures are well known to persons of skill in the fields of molecular 
immunology and diagnostic immunochemistry. 

The method of detecting precancerous cells or cancer cells determines whether the 
level of PB39 antigen is present at an abnormally high level. An "abnormally high" level, or 
content, of the antigen, as used herein, relates to the level of PB39 antigen that is, preferably, 
about two times or more higher than that found in a set of samples taken from normal 
subjects, as expressed by a mean value found therein. These levels or contents may vary, 
depending on the origin of the particular sample taken. In one embodiment, the level or 
content is determined as an absolute number representing the concentration or amount of the 
antigen present in the sample taken. In another embodiment, the level or content of PB39 
antigen may be related by ratio to the level or content of a second antigen known to be 
constitutively present in the particular sample used for the assay. In this embodiment, the 
value of the ratio of the level of PB39 antigen to the level of the constitutive antigen is 
considered to be "abnormally high" when it is preferably about two times or more higher than 
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the ratio found in a set of samples taken from normal subjects or from normal cells, as 
expressed by a mean value found therein. The present invention also provides diagnostic 
kits including the above novel antibodies for purposes of carrying out this method of 
detection. 

5 The predicted N-terminal sequence of the PB39 protein, as shown in SEQ ID NOs:2 

and 4, suggests the presence of a signal-recognition particle sequence for a secreted protein. 
If the PB39 protein is in fact secreted, its concentration may be increased in the serum early 
in the progression of prostate cancer, in prostatic intraepithelial neoplasia (PIN) for example. 
Identification of a serum protein characteristic of prostate cancer would be a useful tool for 

l o the early detection of this disease. Epidemiologic studies have shown that PIN precedes the 
development of prostate cancer by several decades in most men. Thus a marker of early 
malignancy could identify those men who develop PIN lesions early in life and are at greatest 
risk for developing clinically significant disease (Bostwick, D.G. et aL Molecular biology of 
prostatic intraepithelial neoplasia, Prostate, 29: 117-134 (1996)). 

] 5 The PB39 gene described in the present invention is in general dysregulated in 

disease states such as prostate cancer. It may be subject to altered expression (i.e., 
overexpression or underexpression) and/or altered processing, resulting in a change in the 
level of the expressed protein in such a disease state. 

20 EXAMPLES 

Example 1 . Microdissection of cancerous prostate glands and iso lation of RNA. 

Patient samples . All tissue samples were obtained from radical prostatectomy 
specimens from either the Mayo Clinic (Rochester, MN) or the National Cancer Institute 

25 (NCI) (Bethesda, MD). Samples were snap-frozen within minutes after surgery and stored at 
-70 °C until use. Unstained 12-um frozen tissue sections were dissected under microscopic 
visualization as previously described (Emmert-Buck M. et al., Increased gelatinase A and 
cathepsin B activity in invasive tumor regions of human colon cancer samples, Am J Pathol 
145: 1285-1290 (1994); Zhuang Z. et al., A microdissection technique for archival DNA 

30 analysis of specific cell populations in lesions <1 mm in size, Am J Pathol 146: 620-625 

(1995)). Essentially pure populations of normal epithelium and invasive tumor were dissected 
in each case. An adjacent hematoxylin and eosin-stained section was used as a guide to 
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ensure accuracy of dissection. All dissections were completed within 30 minutes of 
preparation of the frozen tissue sections. 

RNA Isolation . Approximately 5000 to 10,000 normal epithelial or tumor cells were 
microdissected for each sample. A scaled down version of the Stratagene (La Jolla, CA) 
5 RNA Microisolation procedure was used to isolate RNA (Chomczynski P. et aL Single-step 
RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal 
Biochem 162: 156-159 (1987)). After resuspension of the RNA pellet, a DNAse step and re- 
extraction was performed using the MessageClean™ kit from GenHunter (Nashville. TN) 
according to the manufacturer's instructions. 

10 

Example 2. Reverse Transcription and Polymerase Chain Reaction Amplification of the 
Prostate-Specific Transcript . 

Complementary DNA (cDNA) was obtained by reverse transcription (RT) using the 
RNAamp™ kit from GenHunter except that 2.5 jiM random hexamer primers from Perkin- 

15 Elmer (Norwalk, CT) were used instead of the primers supplied. The final mixture was 

treated as follows: 65 °C for 5 minutes, 25 °C for 10 minutes, then 1 uL of Moloney Murine 
Leukemia Virus with reverse transcriptase activity (MMLV) (GenHunter) was added and 
incubated at 25°C for 10 minutes, 37°C for 40 minutes, and 94°C for 5 minutes. Each RT 
reaction generated about 20 |iL solution containing about 0.5 -1 ng of cDNA. For each RNA 

20 sample, a negative control was done for the RT reaction, replacing the MMLV with 1 uL of 
water. 

Arbitrary Zinc Finger Polymerase Chain Reaction fPCRY Several PCR reactions 
utilizing arbitrary primers from the RAP-PCR kit from Stratagene and degenerate zinc finger 
primers (Stone B. et al., Targeted RNA fingerprinting: the cloning of differentially-expressed 

25 cDNA fragments enriched for members of the zinc finger gene family, Nucl Acid Res 22: 
2612-2618 (1 994)) were run to assess differences in gene expression between normal 
epithelium and invasive tumor in NCI patient 1 542 (Table 1, case 2; see Example 3). PCR 
conditions were systematically varied to maximize the reproducibility of bands present on 
denaturing electrophoresis gels. The specific PCR primers that generated expressed sequence 

30 tag (EST) clone R00504 (see below) were: Primer 3 : Stratagene arbitrary primer A2, 
AATCTAGAGCTCCAGCAG (SEQ ID NO:5), 
and Primer 2: zinc 2, 
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GTCGTCGAATTCCACACAGGAGAAAAGCC (SEQ ID NO:6). 
PCR conditions were: 1 cycle of 94 °C for 2 minutes, followed by 35 cycles of 94°C for 30 
seconds, 50 °C for 30 seconds, 72°Cfor 1 minute, and 1 cycle of 72°C for 10 minutes. 

Gel Electrophoresis . Labeled amplified DNA was mixed with an equal volume of 
formamide loading dye (95% formamide; 20 mM ethylenediaminetetraacetic acid; 0.05% 
bromophenol blue, and 0.05% xylene cyanol). The samples were denatured for 5 minutes at 
94°C and loaded onto a gel consisting of 6% acrylamide (49:1 acrylamide/bis). Bands in the 
gels were transferred to 3-mm Whatman paper, the paper was dried, and autoradiography was 
performed with Kodak X-OMAT film. 

Results . Normal prostatic epithelial and tumor RNA samples from NCI patient 1542 
(Table 1, case 2; see Example 3) were analyzed by the low-stringency RT-PCR procedure 
outlined above using arbitrary and zinc finger primers. Several parameters, including the 
identities of the RT primers and PCR primer sets (sizes and sequences), as well as reaction 
conditions, were varied in an attempt to elicit differences in gene transcription between 
normal and tumor cells. In general, comparison of normal and tumor samples produced 
identical patterns of gene expression; several hundred PCR products were observed that did 
not vary between normal and tumor cells. 

However, PCR with primers A2 (SEQ ID NO:5) and zinc 2 (SEQ ID NO:6) resulted 
in the presence of a strong product selectively in the tumor sample (Figure 1, Panel A). 
Separate PCR reactions in which only one of the primers was used did not produce a 
similarly sized band. The band was extracted from the gel, reamplified, and subjected to 
partial sequencing. Direct sequencing was performed using the Amplicycle™ sequencing kit 
from Perkin-Elmer according to the manufacturer's instructions. The following 103-base pair 
(bp) sequence was obtained: 

5' ACAGGAATCC CCAGGAGTGA AGAATAAGCA GGAGGCCCCA 
GATTCACCTT TAGGGCAAGG AGAGAGAAAC AGAGTCAAGT AGGTAGTCAT 
CTGCCCTTAA GCC 3' ( SEQ ID NO:9). 

Analysis showed a match of 102 bp out of the 103 bp to a gene sequence in the expressed 
sequence tag (EST) database (GenBank accession R00504). 

The patient from whom this sample was obtained was a 47-year-old black man who 
first presented with localized (Stage T2A) prostate cancer. Histopathologic examination of 
the prostatectomy specimen showed a poorly differentiated adenocarcinoma (Gleason score 



WO 00/05376 PCT/US99/16831 
8). The patient was clinically free of disease until 1 year postoperatively, when he developed 
rapidly rising prostate-specific antigen levels and clinical evidence of recurrent disease. 



Example 3. Analysis of Differential Expression of R00504 . 

Messenger RNA levels of R00504 were determined in normal prostate epithelium and 
corresponding invasive tumor cells from a test panel of 10 prostate carcinoma samples (total 
of 20 samples. Total RNA was recovered from each sample, treated with DNAse, and 
R00504 was amplified by RT-PCR. The level of the beta-actin gene from each sample, 
likewise obtained by RT-PCR, was used as an internal standard to quantitate R00504 levels. 
For the PCR of the beta-actin gene, 1 jiL of the cDNA sample was subjected to PCR with 
specific primers from Clontech (Palo Alto, CA) according to the manufacturer's instructions. 
All reactions were run at least twice to ensure reproducibility of results, and a control 
reaction without reverse transcriptase was run in parallel for all normal and tumor samples. 

To analyze differential expression, R005Q4-specific primers were used: 

Primer 1 = GCATGTTACAGGTAGAAAAGCC (SEQ ID NO:7); 

Primer 2 - CTGGCGTATCTGAAGAGTCTG (SEQ ID NO:8). 
PCR conditions using these primers were as follows: 1 cycle of 94°C for 2 minutes, followed 
by 35 cycles of 94°C for 30 seconds, 60°C for 1 minute, 72°C for 1 minute, and 1 cycle of 
72°C for 10 minutes. Beta-actin and R00504 PCR reactions included dilutions of samples to 
ensure that reactions were not at saturation conditions. All reaction sets were run a minimum 
of two times to ensure reproducibility of results, and samples were run in parallel with a 
negative reverse transcriptase control. Overexpression of R00504 was determined visually 
and was considered to be present when R00504 was selectively expressed in tumor cells or 
substantially increased in tumors compared with beta-actin expression (Figure 1). 

Normal prostatic epithelium and corresponding invasive tumor were microdissected 
from a test panel of 10 prostate carcinoma samples. Using the specific R00504 PCR primers 
given by SEQ ID NOs:7 and 8, samples from NCI patient 1542 showed strong expression of 
R00504 in the tumor sample but no expression in the normal epithelial sample, consistent 
with the initial experiment utilizing primers A2 and zinc 2 (SEQ ID NOs:5 and 6; see 
Example 2). 

Table 1 presents the results obtained probing the ten patient samples for R00504. 
Five of the patients in the test panel showed substantial overexpression of R00504 in the 
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tumor samples (Table K cases 1. 2. 6. 9. 10). Figure 1, Panel B shows examples of two cases 
demonstrating tumor-specific increases in R00504 levels. Case 1 shows expression of 
R00504 in both the normal and tumor samples, with a relative increase in expression in the 
tumor. Case 2 shows selective expression in the tumor sample. Overexpression of R00504 
5 in normal cells relative to the corresponding tumor was not observed in any of the cases. 

These results indicate that R00504 overexpression occurs frequently in prostate cancer. The 
clinical parameters of the tumors varied among patients (Table 1), and no correlation between 
R00504 expression and clinical features of the tumors was found. However, the majority of 
the samples was from patients who had undergone surgery only a short time prior to these 
i 0 studies, so that minimal follow-up data are available. 

Example 4. Northern Blot Analysis of R005Q4 Transcription . 
Q Northern blots were performed to assess the size of the transcript in tissues. In order 

+; to serve as a probe, R00504 was amplified and labeled by PCR with sequence-specific 

or} 5 primers using 32 P-deoxycytidine triphosphate and the PCR conditions described in Example 
\ 5 3. The labeled PCR product was used to probe samples from various human fetal tissues 

^ (obtained from Clontech) according to the manufacturer's recommendations. Two million 

counts per minute per milliliter of hybridization solution was applied, 
f ^ The Northern blots containing RNA from fetal kidney, liver, lung, and brain tissues 

Clo showed a single band of approximately 2.6 kilobases (Figure 2). Expression was highest in 

the fetal liver sample, consistent with the fact that the initial identification of the R00504 

EST used a cDNA library from fetal liver. 

The full gene represented by the RNA identified in this Example, and which contains 

the nucleotide sequence provided in Example 2 as part of EST R00504, is termed PB39 
25 herein. 

Example 5. Northern blot analysis for PB39 in adult and fetal human tissues . 

Northern blot analysis using labeled R00504 as a probe showed a transcript of 
approximately 2.6 kb which was expressed in tissues of the adult colon, small intestine, 
30 ovary, prostate, spleen, and pancreas (Figure 3, Panel A), fetal kidney, liver, and lung (Figure 
3, Panel B), and adult liver and skeletal muscle (results not shown). The level of expression 
was highest in adult pancreas tissue (Figure 3, Panel A). 
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Example 6. Nucleotide sequence of PB39, 

To determine the complete nucleotide sequence of the PB39 cDNA, specific 5' and 3' 
PB39 primers were generated and used to isolate PB39 specific PCR products from a human 
pancreas cDNA library using the Rapid Amplification of cDNA Ends (RACE) method 
5 (Marathon-Ready cDNA, Clontech). The 3 1 and 5 l PB39 RACE primer sequences 
GACCGCATAGACTTCTCAGA (SEQ ID NO: 10) and 
GCATGTTACAGGTAGAAAAGCC (SEQ ID NO:7), 

respectively, were chosen from EST clone R00504. A 700 bp 3' fragment and a 2 kb 5' 
fragment were produced, subcloned into a plasmid pCR vector, and cycle sequenced. To 

10 validate the sequence, gene-specific PCR products amplified from the pancreas library were 
directly sequenced, and verified by 1 0 independent sequencing reactions. Assembly of the 
entire set of sequences produced a 23 1 7 nucleotide cDN A sequence (SEQ ID NO; 1) that 
includes 76 nucleotides of 5' untranslated sequence, a 1677 nucleotide open reading frame 
(559 amino acids), and 564 nucleotides of 3' untranslated sequence (Figure 4, Panels A and 

15 B). The consensus Kozak sequence GCCGCCATGG placed the translation initiation 

methionine at nucleotide position 77 (Kozak ML, Structural features in eukaryotic mRNAs 
that modulate the initiation of translation, J Biol Chem 266: 19867-19870 (1991)). 

Example 7. Nucleotide sequence of a variant of PB39. 

20 Basic Local Alignment Search Tool (BLAST) analysis (Altschul, S. F. et al., Gapped 

BLAST and PSI-BLAST: A new generation of protein database search programs, Nucleic 
Acids Res 25:3389-3402 (1997)) of the PB39 sequence against the human EST database 
showed multiple PB39 EST clones from diverse tissue types. Interestingly, several PB39 
homologous clones showed an identical divergence within the coding sequence at nucleotide 

25 1610 (Figure 4, panel B) with introduction of an additional nucleotide sequence. To analyze 
this longer, alternative form of PB39 further, a PCR primer specific for the inserted sequence 
(TCTGCAAAGTGGCTGAGATGAG (SEQ ID NO:l 1 )) was designed and used to amplify 
cDNA from the pancreas library, together with a PB39-specific 5' primer 
(CCTGCCTTATCTTTCTGAACTGCACC (SEQ ID NO: 12)). The amplified product was 

30 isolated and directly sequenced. The sequence is shown in Figure 4, panels A and B (SEQ 
ID NO:3). Analysis of the open reading frame shows the addition of 48 new amino acids 
beginning at nucleotide position 1613, followed by a stop codon. Thus, the larger transcript 
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of PB39 encodes a 560 amino acid protein in which the 47 C-terminal amino acids found in 
the 2.3 kb PB39 are replaced by 48 new amino acids (Figure 4, panels A and B). A 
schematic diagram of the relationship between the two RNA species is shown in Figure 4, 
Panel C. 

5 Northern blot analysis using a probe specific for the inserted sequence showed a 5kb 

PB39 transcript expressed in adult pancreas and fetal liver tissue. As expected, this probe did 
not hybridize to the 2.6 kb PB39 (Figure 5). A longer exposure of the R00504-probed blots 
did reveal a less intense transcript at 5kb, which would be expected since this sequence is 
common to both transcripts (Figure 3, panels A and B). 

10 RT-PCR analysis was performed to study the expression of the 5 kb PB39 transcript 

in human prostate tissue as described above (Example 3). RT-PCR using primers directed 
against the inserted sequence in microdissected normal and invasive prostate epithelium 
showed a product in 4 of 4 tumor samples, but only 1 of 4 corresponding normal samples 
(results not shown). One of the cases over-expressing the 5 kb transcript did not show over- 

1 5 expression of the 2.6 kb form of PB39. Since the results 
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in Table 1 show overexpression of PB39 in 5 of 1 0 tumor samples, it appears that both the 
2.6 kb and 5 kb forms of PB39 are up-regulated in unique subsets of tumors. The 
physiological significance of this finding is not yet clear. 

The 5 kb PB39 variant transcript was found to match EST clones primarily from fetal 
5 and tumor tissue libraries (Figure 4, panel C). This transcript was also found to be highly 
expressed in a cDNA library from a microdissected prostatic intraepithelial neoplasia focus 
which was sequenced as part of the Cancer Genome Anatomy Project. Thus, PB39 represents 
one of the first identified genes whose expression has been shown to be increased early in 
prostate cancer development. The cellular regulation of PB39 mRNA splice variants, their 
10 precise expression levels during prostate tumorigenesis, and the functional significance of 
altering the C-terminal 47 amino acids remain to be determined. 

I'l Example 8. Localization of PB39 on chromosome 1 1 . 

;P A Whitehead Institute Sequence Tagged Site marker WM 7004 (GenBank acc. 

□15 G22380) maps PB39 to 291 .1 cR from the telomere of the short arm of chromosome 1 1 

j 5! (Schuler GD, Sequence mapping by electronic PCR, Genome Res 7: 541-550 (1997)). To 

confirm this result, in situ hybridization was performed following protocols previously 

\2 described (Pinkel, D. et aL, Fluorescence in situ hybridization with human chromosome- 

;t specific libraries: detection of the trisomy 21 and translocations of chromosome 4, Proc Natl 

q|>0 Acad Sci USA. 85:9138-9142 (1988); Hirai, M. et aL, A method for simultaneous detection 

fij. 

of fluorescent G-bands and in situ hybridization signals, Cytogenet Cell Genet 66: 149-151 
(1994)). The chromosomal localization of the gene was determined by hybridization of the 2 
kb 5 1 RACE PCR product to metaphase chromosomes and converting DAPI banding to "G- 
banding" using IP Lab Spectrum™ (Scan Analytics, Fairfax, VA) software for chromosomal 

25 identification. This result indicates that the PB39 gene maps to human chromosome 1 lpl 1 . 1- 
1 1 .2. A total of 50 cells was examined to determine the precise chromosomal location of the 
probe. In all metaphases scored, clear signals were seen on the short arm of chromosome 1 1 
(Figure 6, panels A and B). DAPI banding unambiguously showed the position of the signal 
in the region 1 lpl 1.1-1 K2 (Figure 6, panels A, B, C and D). Both the STS primers and the 

30 fluorescence in situ hybridization (FISH) probe were directed against sequences common to 
the 2.6 kb and 5kb transcripts. In both cases hybridization to only one chromosomal location 
was identified. Interestingly, the human chromosome 1 lpl 1-1 lpl2 region has been 
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postulated to harbor one or more metastasis suppressor genes, including KAIL and has also 
been shown to be deleted in 70% of advanced prostate cancers (Dong. J.T. et al. s KAI1, A 
metastasis suppressor gene for prostate cancer on human chromosome 1 lpl L Science 
268:884-886 (1995); Kawana, Y. et al, Location of KAI1 on the short arm of human 
5 chromosome 1 1 and frequency of allelic loss in advanced prostate cancer, Prostate, 32: 205- 
213 (1997)). 

Example 9. Expression of PB39 in COS cells, 

A cDNA encoding PB39 is to be ligated into a vector that can be used to transfect a 

10 mammalian cell. In particular, SV40 vectors may be employed for this purpose. The PB39 
gene is to be subcloned into an appropriate plasmid such that suitable restriction sites become 
available. The cDNA is then to be extracted using the appropriate restriction enzyme, and 
ligated into a complementary site in the SV40 genome. 

The recombinant SV40 is to be used to transfect, for example, simian COS cells. The 

15 vector is combined with calcium phosphate for incorporation into the cells. Amplification 
using vectors incorporating the drug resistance gene dhfr permits larger amounts of the 
cloned gene to accumulate upon exposure of the culture to methotrexate, and to be expressed 
in the cultured cells. Upon completion of the selection, the amplified cells are to be cultured 
to produce the recombinant PB39 gene product. 

20 PB39 protein is to be isolated and purified from the cultured cells. The cells are to be 

harvested, and gently disrupted to release the cellular contents into the medium. PB39 is to 
be purified using known methods including differential centrifugation. and various forms of 
column chromatography, such as ion exchange, hydrophobic interaction, and size exclusion. 
Purity is to be assessed using analytical chromatographic procedures, sodium dodecyl sulfate- 

25 polyacrylamide gel electrophoresis and/or like procedures. 

Example 10. Production of polyclonal anti-PB39 antibodies. 

Putative epitopic peptide sequences were synthesized in order to elicit polyclonal 
antibodies specific for PB39. Peptides common to both the 2.6 kb encoded protein and the 
30 variant transcript protein are: 

peptide 644: Thr Gin Asp Glu Gin Arg Arg Trp Pro Gly Cys Asp Gin Gin (SEQ ID 
NO:13), 
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peptide 645: Glu Asn Leu Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu (SEQ ID 
NO: 14), and 

peptide 646: Arg Pro Arg Tyr Cys Lys He Gin Lys Leu Thr Asn Ala(SEQ ID 
NO:15). 

5 A peptide sequence specific for the 2.6 kb encoded protein is 

peptide 655: Ala Asn Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser (SEQ ID 
NO: 16). 

A peptide sequence specific for the variant transcript protein is 

peptide 656: Ala Arg Gly Thr Ser Glu Val Ser Asn Leu Gin Val Ser (SEQ ID NO: 1 7). 
] 0 Rabbits were immunized with a BS A conjugate of each of the above peptides, using 

complete Freund's adjuvant. Immunizations weTe done every two weeks. The rabbits were 
bled prior to the immunizations and again during the intervals between injections, 
y The anti -peptide antibody sera were affinity purified over the corresponding peptide- 

Ly conjugated Affl-Gel 10™ columns. After desorption from the column, the antibody- 

21 5 containing eluates were concentrated for storage and use. 

Ul The binding of the resulting antibodies was assessed in western blots. For testing of 

□ antibodies raised against peptides 655 and 656 30 ^g of protein extracted from a prostate 

\Z cancer cell line (1 542/8.4 clone) were analyzed, For testing of antibodies raised against 
y ! peptides 644, 645 and 646, protein extracted from roughly 24,000 cells from microdissected 
f|20 tumor samples were analyzed. Proteins were separated on a 8-12% gradient P. A.G.E. gel 
(NOVEX) run in tris glycine and transferred to a PVDF membrane. The membranes were 
blocked and incubated overnight at 4°C with varying titers of the anti-peptide antibody 
mixtures (range from 1 : 1 0 to 1:5000, volume/volume). The membranes were washed and 
incubated with an HRP conjugated goat anti-rabbit secondary antibody. After washing, the 
25 membranes were incubated with a chemiluminescence substrate and visualized on X-OMAT 
film. For all antibodies tested, a 60 kD band was evident. This corresponds to the size 
anticipated for both the 2.6 kb encoded protein and the variant transcript. In Figure 7, Panels 
A, B, and C, show the results obtained with the anti-646, -644 and -656 antibodies, 
respectively. For the westerns probed with anti-655 and -645 antibodies, the bands were 
30 present, but were too faint to reproduce in a photograph. Antibody specificity was 

demonstrated by eliminating the bands upon competition with increasing concentrations of 
the corresponding epitopic peptide. 
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Example 1 1 . Monoclonal antibodies. 

Purified PB39, or BSA-conjugated peptides 646, 644 or 656, are to be injected into 
mice together with Freund's complete adjuvant. Subsequently spleen cells obtained from the 
mice are to be fused with immortalized murine tumor cells to produce hybridomas. Each 
hybridoma cell is to be expanded, and the supernatant from the culture of each clone is to be 
assessed for its ability to bind immunospecifically with authentic PB39 antigen. Clones 
identified as successfully secreting binding antibodies are to be retained and stored. 
Monoclonal anti-PB39 antibodies from such clones are obtained by culturing these clones. 

Example 12. ELISA assay for PB39 in prostate tissue. 

Prostate tissue is to be obtained from a subject. If desired, the tissue is to be 
microdissected to extract cancerous and normal epithelial cells. The tissue sample is to be 
homogenized. The wells of a microliter plate are to be contacted with polyclonal anti-PB39 
from Example 10 to adsorb the antibody, and remaining sites blocked with casein. Portions 
of the homogenate are then to be added to the wells of the plate. Aliquots of authentic PB39 
antigen at known concentrations are to be added to additional wells to serve as standards for 
quantitation of the amount of antigen. The wells are then to be treated with either a 
monoclonal anti-PB39 antibody from Example 10, or with a further aliquot of the polyclonal 
antibody. This second antibody will previously have been conjugated with horse radish 
peroxidase. The wells are then treated with the chromogenic substrate o-phenylenediamine 
dihydrochloride, and the resulting color is to be determined on a microtiter plate 
spectrophotometry detector. The absorbance values so obtained are calibrated against the 
PB39 standard for quantitating the amount of antigen present in the samples. Of course, 
other visualizing or detecting systems known in the art can be used if desired. 
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1. A purified and isolated nucleic acid comprising a sequence given in SEQ ID NO: L 
a sequence which is complementary to the sequence given in SEQ ID NO: 1 , a sequence 
given in SEQ ID NO:3. or a sequence which is complementary to the sequence given in SEQ 
ID NO:3. 

2. The nucleic acid described in claim 1, wherein the nucleic acid is an RNA. 

3. The nucleic acid described in claim 1, wherein the nucleic acid is a cDNA. 

4. A purified and isolated nucleic acid comprising a sequence that is a fragment of the 
sequence given in SEQ ID NO:l, the sequence which is complementary to the sequence 
given in SEQ ID NO: 1 , the sequence given in SEQ ID NO: 3, or the sequence which is 
complementary to the sequence given in SEQ ID NO:3, wherein the fragment hybridizes 
specifically with the sequence given in SEQ ID NO:l, the sequence which is complementary 
to the sequence given in SEQ ID NO: 1 , the sequence given in SEQ ID NO: 3, or the 
sequence which is complementary to the sequence given in SEQ ID NO:3. 

5. The nucleic acid described in claim 4, wherein the nucleic acid is selected from the 
group consisting of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 1 1, and SEQ 
IDNO:12. 

6. A polypeptide encoded by a nucleic acid comprising the sequence given in SEQ ID 
NO:l or the sequence given in SEQ ID NO:3. 

7. The polypeptide described in claim 6, wherein the polypeptide is a recombinantly 
produced polypeptide. 

8. An antibody that binds immunospecifically with a polypeptide encoded by a 
nucleic acid comprising a sequence given in SEQ ID NO:l or a sequence given in SEQ ID 
NO:3. 
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9. A method of detecting precancerous cells or cancer cells in the prostate of a 
subject, said method comprising providing a sample of tissue or fluid from the subject and 
determining whether the sample contains an abnormally high content of a nucleic acid 
comprising a sequence given in SEQ ID NO:l , a sequence which is complementary to the 
sequence given in SEQ ID NO:l ; a sequence given in SEQ ID NO:3, a sequence which is 
complementary to the sequence given in SEQ ID NO:3, or a fragment of any of the 
sequences, whereby determining that the sample contains an abnormally high content of the 
nucleic acid indicates that the subject has precancerous cells or cancer cells in the prostate. 

10. The method described in claim 9, wherein the sample is a body fluid. 

1 1 . The method described in claim 9, wherein the sample is tissue originating from 
the prostate. 

12. The method described in claim 9, wherein the determining step comprises 
amplifying the nucleic acid and detecting the amplified nucleic acid. 

13. A method of detecting precancerous cells or cancer cells in the prostate of a 
subject, said method comprising providing a sample of tissue or fluid from the subject and 
determining whether the sample contains an abnormally high content of a polypeptide 
encoded by a nucleic acid comprising a sequence given in SEQ ID NO:l or SEQ ID NO:3. 
whereby determining that the sample contains an abnormally high content of the polypeptide 
indicates that the subject has precancerous cells or cancer cells in the prostate. 

14. The method described in claim 13, wherein the sample is a body fluid. 

15. The method described in claim 13, wherein the sample is tissue originating from 
the prostate. 
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16. The method described in claim 13, wherein the determining step further comprises 
contacting at least a portion of the sample with an antibody that binds immunospecifically 
with the polypeptide and determining the amount of the antibody that has bound with the 
polypeptide present in the sample. 
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Fiqure 4A 

■ ■ MWM ■ Mm Mi 






1 


ccggggctggaggggggcaagcgggttccgaggtgcaaagcctgg 








tgccccgagccctgcggagctcggggccagc 






77 


atqgcccccacgctgcaacaggcgtaccggagqcgctggtggatg 








MAPTLQQAYRRRWWM 


15 




122 


gcctgcacggctgtgctgg4gaacctcttcttctctgctgtactc 








ACTAVLENLFFSAVL 


30 




167 


ctgggctggggctccctgttgatcattctgaagaacgagggcttc 








LGWGSLLI ILKNEGF 


45 




212 


tattccagcacgtgcccagctgagagcagcaccaacaccacccag 








YSSTCPAESSTNTTQ 


60 




257 


gatgagcagcgcaggtggccaggctgtgaccagcaggacgagatg 








DEQRRWPGCDQQDEM 


75 




302 


ctcaacctgggcttcaccattggttccttcgtgctcagcgccacc 








LNLGFTI GSFVLSAT 


90 




347 


accctgccactggggatcctcatggaccgctttggcccccgaccc 








TLPLGILMDRFGPRP 


105 


&"'k 


392 


gtgcggctggttggcagtgcctgcttcactgcgtcctgcaccctc 








VRLVGSACFTASCTL 


120 


"%% 

if?? 


437 


atqgccctggcctcccgggacgtggaagctctgtctccgttgata 




' 7, I 




MALASRDVEALS PLI 


135 




482 


ttcctggcgctgtccctgaatggctttggtggcatctgcctaacg 




-51 




FLALSLNGFGGI CLT 


150 




527 


ttcacttcactcacgctgcccaacatgtttgggaacctgcgctcc 








FTS LTLPNMFGNLRS 


165 




572 


acgttaatggccctcatgattggctcttacgcctcttctgccatt 








TLMALMI GSYAS SAI 


180 




617 


acgttcccaggaatcaagctgatctacgatgccggtgtggccttc 




fi 




TFPGIKLIYDAGVAF 


195 


«t 


662 


gtggtcatcatgttcacctggtctggcctggcctgccttatcttt 




5 i 




VVIMFTWSGLACLIF 


210 




707 


ctgaactgcaccctcaactggcccatcgaagcctttcctacccct 








LNCTLNWP IEAFPAP 


225 




752 


gaggaagtcaattacacgaagaagatcaagctgagtgggccggcc 








EEVNYTKKIKLS GLA 


240 




797 


ctggaccacaaggcgacaggtgaccCcttctacacccagtgacc 








LDHKVTGDLFYTHVT 


255 




842 


accacgggcagaggctcagccagaaggcccccagcctggaggac 








TMGQRLSQKAPSLED 


270 




887 


ggttcggatgccttcatgtcaccccaggatgttcggggcacctca 








GSDAFMSPQDVRGTS 


285 




932 


gaaaaccttcctgagaggtctgtccccttacgcaagagcctctgc 








ENLPERSVPLRKSLC 


300 




977 


tcccccactttcctgtggagcctcctcaccatgggcatgacccag 








SPTFLWSLLTMGMTQ 


315 




1022 


ctgcggatcatcttctacatggctgctgtgaacaagatgctggag 








LRI I FYMAAVNKMLE 


330 




1067 


taccttgtgactggtggccaggagcatgagacaaatgaacagdaa 








YLVTGGQEHETNEQQ 


345 




1112 


caaaaggtggcagagacagttgggttctactcctccgtcttcggg 








QKVAETVGFYS SVFG 


360 




1157 


gccatgcagctgttgtgccttctcacctgccccctcattggctac 








AMQLLCLLTCPLIGY 


375 
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1202 a tcatggac tggcggatcaaggac tgcgtggacgccccaac tcag 

IMDWRIKDCVDAPTQ 390 
1247 ggcac tg tec teggaga tgccagggacggggt tgc taccaaa t cc 

GTVLGDARDGVATKS 405 
1292 atcagaccacgctactgcaagatccaaaagctcaccaatgccatc 

IRPRYCKIQKLTNAI 420 
1337 agtgccttcaccctgaccaacctgctgcttgtgggttttggcatc 

SAFTLTNLLLVGFGI 435 
acctgtctcatcaacaacttacacctccagtttgtgacctttgtc 
TCLINNLHLQFVTFV 450 
1427 ctgcacaccattgttcgaggtttc ttccdctcagcc tgtgggagt 

LHTIVRGFFHSACGS 465 
1472 ctctatgctgcagtgttcccatccaaccactttgggacgctgaca 

LYAAVFPSNHFGTLT 480 
1517 ggcctgcagtccctcatcagtgctgtgttcgccttgcttcagcag 

GLQSLISAVFALLQQ 495 
1562 ccacttttcatggcgatggtgggacccctgaaaggagagcoctcc 

PLFMAMVGPLKGEPF 510 
1607 tgggtgaatctgggcctcctgctattctcactcctgggattcctg 
agagcgagggttggtgtggggggagcaggagccactctc 
WVNLGLLLFS LLGFL 525 
RARVGVGGAGATL 525 
ttgccttcctacctcttctattaccgtgcccggctccagcaggag 
ctgggggcaggggtagggccttgtatgtggtgccatccctcactc 
LPSYLFYYRARLQQE 540 
LGAGVGPCMWCHPSL 540 
1697 tacgccgccaatgggatgggcccactgaaggtgcttagcggctct 
atctcagccagaggcacctcagaggtctctaatccgcaggtttcc 
YAANGMGPLKVLSGS 555 
I SARGT SEVSNLQVS 555 
1742 gaggtgaccgea tag 

aagttgtctgccttttag 
E V T A * 559 
K L S A F *#560 



1652 



acttctcagaccaagggacctggatgacaggcaatcaaggcctga 
gcaaccaaaaggagtgccccatatggcttttctacctgcaacacg 
cacatagagccatggccgtagatttataaataccaagagaagttc 
tatttttgtaaagactgcaaaaaggaggaaaaaaaaccttcaaaa 
acgccccctaagtcaacgctccattgactgaagacagcccctatl 
ctagaggggttgagctttcttcctccttgggttggaggagaccag 
ggtgcctcttatctccttctagcggtctgcctcccggcacctctt 
ggggggatcggcaaacaggctacccctgaggtcccatgtgccatg 
agtgtgcacaacatgcaatgtgtctgtgtatgtgtgccatgaatg 
tgagaaaaacacagccctcctttcagaaggaaaggggcctgaggg 
ctgtgtcctgggttaggggttgggggtcggccccttccagggcca 
ggaaggcaggttccctctctggtgctgctgcttgcaagtcttaga 
ggaaataaaaagggaagtgag aaaaaaaaa 

# there are approximately an additional 2300 nucleotides 
at this site contributing to the 5 kb transcript's 3'UTR. 
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SEQUENCE LISTING 

JC07Rec'd PCT/PTO } 6 JAN 2001 

<110> Chuaqui, Rodrigo F. 
Cole, Kristina A. 
Liotfca, Lance A. 



<12 0> PB3 9, A Gene Overexpressed in Prostate 
Cancer, and Uses Thereof 



<130> 6299/62051 
<160> 17 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 2326 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (77) . . . (1753) 
<400> 1 

ccggggctgg aggggggcaa gcgggttccg aggtgcaaag cctggtgccc cgagccctgc 6 0 

ggagctcggg gccagc atg gcc ccc acg ctg caa cag gcg tac egg agg cgc 112 
Met Ala Pro Thr Leu Gin Gin Ala Tyr Arg Arg Arg 
15 10 

tgg tgg atg gcc tgc acg get gtg ctg gag aac etc ttc ttc tct get 160 
Trp Trp Met Ala Cys Thr Ala Val Leu Glu Asn Leu Phe Phe Ser Ala 
15 20 25 

gta etc ctg ggc tgg ggc tec ctg ttg ate att ctg aag aac gag ggc 20 8 

Val Leu Leu Gly Trp Gly Ser Leu Leu lie lie Leu Lys Asn Glu Gly 
30 35 40 

ttc tat tec age acg tgc cca get gag age age ace aac ace acc cag 256 
Phe Tyr Ser Ser Thr Cys Pro Ala Glu Ser Ser Thr Asn Thr Thr Gin 
45 50 55 60 

gat gag cag cgc agg tgg cca ggc tgt gac cag cag gac gag atg etc 3 04 

Asp Glu Gin Arg Arg Trp Pro Gly Cys Asp Gin Gin Asp Glu Met Leu 
65 70 75 

aac ctg ggc ttc acc att ggt tec ttc gtg etc age gcc acc acc ctg 3 52 

Asn Leu Gly Phe Thr lie Gly Ser Phe Val Leu Ser Ala Thr Thr Leu 
80 B5 90 

cca ctg ggg ate etc atg gac cgc ttt ggc ccc cga ccc gtg egg ctg 4 00 

Pro Leu Gly lie Leu Met Asp Arg Phe Gly Pro Arg Pro Val Arg Leu 
95 ioo 105 

gtt ggc agt gcc tgc ttc act gcg tec tgc acc etc atg gcc ctg gcc 448 
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Val Gly Ser Ala Cys Phe Thr Ala Ser Cys Thr Leu Met Ala Leu Ala 
110 115 120 

tec egg gac gtg gaa get ctg tct ccg ttg ata ttc ctg gcg ctg tec 496 
Ser Arg Asp Val Glu Ala Leu Ser Pro Leu He Phe Leu Ala Leu Ser 
125 130 135 140 

ctg aat ggc ttt ggt ggc ate tgc eta acg ttc act tea etc acg ctg 544 
Leu Asn Gly Phe Gly Gly He Cys Leu Thr Phe Thr Ser Leu Thr Leu 
145 150 155 

ccc aac atg ttt ggg aac ctg cgc tec acg tta atg gec etc atg att 592 
Pro Asn Met Phe Gly Asn Leu Arg Ser Thr Leu Met Ala Leu Met He 
160 165 170 

ggc tct tac gec tct tct gec att acg ttc cca gga ate aag ctg ate 64 0 

Gly Ser Tyr Ala ser ser Ala He Thr Phe Pro Gly He Lys Leu He 

175 180 185 

tac gat gee ggt gtg gec ttc gtg gtc ate atg ttc ace tgg tct ggc 688 
Tyr Asp Ala Gly Val Ala Phe Val Val He Met Phe Thr Trp Ser Gly 
190 195 200 

ctg gec tgc ctt ate ttt ctg aac tgc ace etc aac tgg ccc ate gaa 736 
Leu Ala Cys Leu He Phe Leu Asn Cys Thr Leu Asn Trp Pro He Glu 
205 210 215 220 

gec ttt cct gec cct gag gaa gtc aat tac acg aag aag ate aag ctg 784 
Ala Phe Pro Ala Pro Glu Glu Val Asn Tyr Thr Lys Lys He Lys Leu 
225 230 235 

agt ggg ctg gec ctg gac cac aag gtg aca ggt gac etc ttc tac acc 832 
Ser Gly Leu Ala Leu Asp His Lys Val Thr Gly Asp Leu Phe Tyr Thr 
240 245 250 

cat gtg acc acc atg ggc cag agg etc age cag aag gee ccc age ctg 88 0 

His Val Thr Thr Met Gly Gin Arg Leu Ser Gin Lys Ala Pro Ser Leu 
255 260 265 

gag gac ggt teg gat gec ttc atg tea ccc cag gat gtt egg ggc acc 928 
Glu Asp Gly Ser Asp Ala Phe Met Ser Pro Gin Asp Val Arg Gly Thr 
270 275 280 

tea gaa aac ctt cct gag agg tct gtc ccc tta cgc aag age etc tgc 976 
Ser Glu Asn Leu Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu Cys 
285 290 295 300 

tec ccc act ttc ctg tgg age etc etc acc atg ggc atg acc cag ctg 1024 
Ser Pro Thr Phe Leu Trp Ser Leu Leu Thr Met Gly Met Thr Gin Leu 
305 310 315 

egg ate ate ttc tac atg get get gtg aac aag atg ctg gag tac ctt 1072 
Arg He He Phe Tyr Met Ala Ala Val Asn Lys Met Leu Glu Tyr Leu 
320 325 330 

gtg act ggt ggc cag gag cat gag aca aat gaa cag caa caa aag gtg 1120 
Val Thr Gly Gly Gin Glu His Glu Thr Asn Glu Gin Gin Gin Lys Val 
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335 340 345 

gca gag aca gtt ggg ttc tac tec tec gtc ttc ggg gec atg cag ctg 1168 

Ala Glu Thr Val Gly Phe Tyr Ser Ser Val Phe Gly Ala Met Gin Leu 

350 355 360 

ttg tgc ctt etc acc tgc ccc etc att ggc tac ate atg gac tgg egg 1216 

Leu Cys Leu Leu Thr Cys Pro Leu lie Gly Tyr lie Met Asp Trp Arg 

365 370 375 380 

ate aag gac tgc gtg gac gec cca act cag ggc act gtc etc gga gat 1264 

lie Lys Asp Cys Val Asp Ala Pro Thr Gin Gly Thr Val Leu Gly Asp 

385 390 395 

gec agg gac ggg gtt get acc aaa tec ate aga cca cgc tac tgc aag 1312 

Ala Arg Asp Gly Val Ala Thr Lys Ser lie Arg Pro Arg Tyr Cys Lys 

400 405 410 

ate caa aag etc acc aat gec ate agt gee ttc acc ctg acc aac ctg 1360 

lie Gin Lys Leu Thr Asn Ala lie Ser Ala Phe Thr Leu Thr Asn Leu 

415 420 425 

ctg ctt gtg ggt ttt ggc ate acc tgt etc ate aac aac tta cac etc 1408 

Leu Leu Val Gly Phe Gly lie Thr Cys Leu lie Asn Asn Leu His Leu 

430 * 435 440 

cag ttt gtg acc ttt gtc ctg cac acc act gtt cga ggt ttc ttc cac 1456 

Gin Phe Val Thr Phe Val Leu His Thr lie Val Arg Gly Phe Phe His 

445 450 455 460 

tea gec tgt ggg agt etc tat get gca gtg ttc cca tec aac cac ttt 1504 

Ser Ala Cys Gly Ser Leu Tyr Ala Ala Val Phe Pro Ser Asn His Phe 

465 470 475 

ggg acg ctg aca ggc ctg cag tec etc ate agt get gtg ttc gec ttg 1552 

Gly Thr Leu Thr Gly Leu Gin Ser Leu lie Ser Ala Val Phe Ala Leu 

480 485 490 

ctt cag cag cca ctt ttc atg gcg atg gtg gga ccc ctg aaa gga gag 16 00 

Leu Gin Gin Pro Leu Phe Met Ala Met Val Gly Pro Leu Lys Gly Glu 

495 500 505 

ccc ttc tgg gtg aat ctg ggc etc ctg eta ttc tea etc ctg gga ttc 1648 

Pro Phe Trp Val Asn Leu Gly Leu Leu Leu Phe Ser Leu Leu Gly Phe 

510 515 520 

ctg ttg cct tec tac etc ttc tat tac cgt gec egg etc cag cag gag 1696 

Leu Leu Pro Ser Tyr Leu Phe Tyr Tyr Arg Ala Arg Leu Gin Gin Glu 

525 530 535 540 

tac gee gec aat ggg atg ggc cca ctg aag gtg ctt age ggc tct gag 1744 

Tyr Ala Ala Asn Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser Glu 

545 550 555 

gtg acc gca tagacttctc agaccaaggg acctggatga caggcaatca 1793 
Val Thr Ala 
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aggcctgagc aaccaaaagg agtgccccat atggcttttc tacctgtaac atgcacatag 1853 

agccatggcc gtagatttat aaataccaag agaagttcta tttttgtaaa gactgcaaaa 1913 

aggaggaaaa aaaaccttca aaaacgcccc ctaagtcaac gctccattga ctgaagacag 1973 

tccctatcct agaggggttg agctttcttc ctccttgggt tggaggagac cagggtgcct 2033 

cttatctcct tctagcggtc tgcctcctgg tacctcttgg ggggatcggc aaacaggcta 20 93 

cccctgaggt cccatgtgcc atgagtgtgc acaacatgca atgtgtctgt gtatgtgtga 2153 

atgtgagaaa aacacagccc tcctttcaga aggaaagggg cctgaggtgc cagctgtgtc 2213 

ctgggttagg ggttgggggt cggccccttc cagggccagg aaggcaggtt ccctctctgg 22 73 

tgctgctgct tgcaagtctt agaggaaata aaaagggaag tgagaaaaaa aaa 2326 



<210> 2 

<211> 559 

<212> PRT 

<213> Homo sapiens 



<400> 2 
Met Ala Pro Thr 
1 

Cys Thr Ala Val 
20 

Trp Gly Ser Leu 
35 

Thr Cys Pro Ala 
50 

Arg Trp Pro Gly 
65 

Thr lie Gly Ser 

Leu Met Asp Arg 
100 

Cys Phe Thr Ala 
115 

Glu Ala Leu Ser 
130 

Gly Gly lie Cys 
145 

Gly Asn Leu Arg 

Ser Ser Ala lie 
180 

Val Ala Phe Val 
195 

lie Phe Leu Asn 
210 

Pro Glu Glu Val 
225 

Leu Asp His Lys 

Met Gly Gin Arg 
260 

Asp Ala Phe Met 
275 

Pro Glu Arg Ser 
290 

Leu Trp Ser Leu 

305 



Leu Gin Gin Ala 
5 

Leu Glu Asn Leu 

Leu lie lie Leu 
40 

Glu Ser Ser Thr 
55 

Cys Asp Gin Gin 

70 

Phe Val Leu Ser 
85 

Phe Gly Pro Arg 

Ser Cys Thr Leu 
120 

Pro Leu lie Phe 
135 

Leu Thr Phe Thr 
150 

Ser Thr Leu Met 
165 

Thr Phe Pro Gly 

Val lie Met Phe 
200 

Cys Thr Leu Asn 
215 

Asn Tyr Thr Lys 
230 

Val Thr Gly Asp 
245 

Leu Ser Gin Lys 

Ser Pro Gin Asp 
280 

Val Pro Leu Arg 
295 

Leu Thr Met Gly 
310 



Tyr Arg Arg Arg 
10 

Phe Phe Ser Ala 
25 

Lys Asn Glu Gly 

Asn Thr Thr Gin 
60 

Asp Glu Met Leu 

75 

Ala Thr Thr Leu 
90 

Pro Val Arg Leu 
105 

Met Ala Leu Ala 

Leu Ala Leu Ser 
14 0 

Ser Leu Thr Leu 
155 

Ala Leu Met He 
170 

He Lys Leu He 
185 

Thr Trp Ser Gly 

Trp Pro He Glu 
220 

Lys He Lys Leu 
235 

Leu Phe Tyr Thr 
250 

Ala Pro Ser Leu 
265 

Val Arg Gly Thr 

Lys Ser Leu Cys 
300 

Met Thr Gin Leu 
315 



Trp Trp Met Ala 
15 

Val Leu Leu Gly 
30 

Phe Tyr Ser Ser 
45 

Asp Glu Gin Arg 
Asn Leu Gly Phe 

80 

Pro Leu Gly He 
95 

Val Gly Ser Ala 
110 

Ser Arg Asp Val 
125 

Leu Asn Gly Phe 

Pro Asn Met Phe 
160 

Gly Ser Tyr Ala 
175 

Tyr Asp Ala Gly 
190 

Leu Ala Cys Leu 
205 

Ala Phe Pro Ala 

Ser Gly Leu Ala 
240 

His Val Thr Thr 
255 

Glu Asp Gly Ser 
270 

Ser Glu Asn Leu 
285 

Ser Pro Thr Phe 

Arg He He Phe 
320 
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Tyr Met Ala Ala Val Asn Lys Met Leu Glu Tyr Leu Val Thr Gly Gly 

325 330 335 

Gin Glu His Glu Thr Asn Glu Gin Gin Gin Lys Val Ala Glu Thr Val 

340 345 350 

Gly Phe Tyr Ser Ser Val Phe Gly Ala Met Gin Leu Leu Cys Leu Leu 

355 360 365 

Thr Cys Pro Leu lie Gly Tyr lie Met Asp Trp Arg lie Lys Asp Cys 

370 375 380 

Val Asp Ala Pro Thr Gin Gly Thr Val Leu Gly Asp Ala Arg Asp Gly 
385 390 395 400 

Val Ala Thr Lys Ser He Arg Pro Arg Tyr Cys Lys He Gin Lys Leu 

405 410 415 

Thr Asn Ala He Ser Ala Phe Thr Leu Thr Asn Leu Leu Leu Val Gly 

420 425 430 

Phe Gly He Thr Cys Leu He Asn Asn Leu His Leu Gin Phe Val Thr 

435 440 445 

Phe Val Leu His Thr He Val Arg Gly Phe Phe His Ser Ala Cys Gly 

450 455 460 

Ser Leu Tyr Ala Ala Val Phe Pro Ser Asn His Phe Gly Thr Leu Thr 
465 470 475 480 

Gly Leu Gin Ser Leu He Ser Ala Val Phe Ala Leu Leu Gin Gin Pro 

485 490 495 

Leu Phe Met Ala Met Val Gly Pro Leu Lys Gly Glu Pro Phe Trp Val 

500 505 510 

Asn Leu Gly Leu Leu Leu Phe Ser Leu Leu Gly Phe Leu Leu Pro Ser 

515 520 525 

Tyr Leu Phe Tyr Tyr Arg Ala Arg Leu Gin Gin Glu Tyr Ala Ala Asn 

530 535 540 

Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser Glu Val Thr Ala 
545 550 555 

<210> 3 

<211> 3442 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (1760) . . . (3439) 
<400> 3 

ccggggctgg aggggggcaa gcgggttccg aggtgcaaag cctggtgccc cgagccctgc 6 0 

ggagctcggg gccagcatgg cccccacgct gcaacaggcg taccggaggc gctggtggat 120 

ggcctgcacg gctgtgctgg agaacctctt cttctctgct gtactcctgg gctggggctc 18 0 

cctgttgatc attctgaaga acgagggctt ctattccagc acgtgcccag ctgagagcag 240 

caccaacacc acccaggatg agcagcgcag gtggccaggc tgtgaccagc aggacgagat 300 

gctcaacctg ggcttcacca ttggttcctt cgtgctcagc gccaccaccc tgccactggg 360 

gatcctcatg gaccgctttg gcccccgacc cgtgcggctg gttggcagtg cctgcttcac 420 

tgcgtcctgc accctcatgg ccctggcctc ccgggacgtg gaagctctgt ctccgttgat 480 

attcctggcg ctgtccctga atggctttgg tggcatctgc ctaacgttca cttcactcac 540 

gctgcccaac atgtttggga acctgcgctc cacgttaatg gccctcatga ttggctctta 600 

cgcctcttct gccattacgt tcccaggaat caagctgatc tacgatgccg gtgtggcctt 660 

cgtggtcatc atgttcacct ggtctggcct ggcctgcctt atctttctga actgcaccct 720 

caactggccc atcgaagcct ttcctgcccc tgaggaagtc aattacacga agaagatcaa 780 

gctgagtggg ctggccctgg accacaaggt gacaggtgac ctcttctaca cccatgtgac 840 

caccatgggc cagaggctca gccagaaggc ccccagcctg gaggacggtt cggatgcctt 900 

catgtcaccc caggatgttc ggggcacctc agaaaacctt cctgagaggt ctgtcccctt 960 
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acgcaagagc ctctgctccc ccactttcct gtggagcctc ctcaccatgg gcatgaccca 1020 

gctgcggatc atcttctaca tggctgctgt gaacaagatg ctggagtacc ttgtgactgg 1080 

tggccaggag catgagacaa atgaacagca acaaaaggtg gcagagacag ttgggttcca 114 0 

ctcctccgtc ttcggggcca tgcagctgtt gtgccttctc acctgccccc tcattggcta 1200 

catcatggac tggcggatca aggactgcgt ggacgcccca actcagggca ctgtcctcgg 126 0 

agatgccagg gacggggttg ctaccaaatc catcagacca cgctactgca agatccaaaa 1320 

gctcaccaat gccatcagtg ccttcaccct gaccaacctg ctgcttgtgg gttttggcat 1390 

cacctgtctc atcaacaact tacacctcca gtttgtgacc tttgtcctgc acaccattgt 1440 

tcgaggtttc ttccactcag cctgtgggag tctctatgct gcagtgttcc catccaacca 1500 

ctttgggacg ctgacaggcc tgcagtccct catcagtgct gtgttcgcct tgcttcagca 156 0 

gccacttttc atggcgatgg tgggacccct gaaaggagag cccttctggg tgagagcgag 162 0 

ggttggtgtg gggggagcag gagccactct cctgggggca ggggtagggc ctcgtatgtg 1680 

gtgccatccc tcactcatct cagccagagg cacctcagag gtctctaatc tgcaggtttc 174 0 

caagttgtct gccttttag atg gcc ccc acg ctg caa cag gcg tac egg agg 1792 

Met Ala Pro Thr Leu Gin Gin Ala Tyr Arg Arg 
15 10 

cgc tgg tgg atg gcc tgc acg get gtg ctg gag aac etc ttc ttc tct 184 0 

Arg Trp Trp Met Ala Cys Thr Ala Val Leu Glu Asn Leu Phe Phe Ser 
15 20 25 

get gta etc ctg ggc tgg ggc tec ctg ttg ate att ctg aag aac gag 188B 
Ala Val Leu Leu Gly Trp Gly Ser Leu Leu lie lie Leu Lys Asn Glu 
30 35 40 

ggc ttc tat tec age acg tgc cca get gag age age ace aac acc acc 193 6 

Gly Phe Tyr Ser Ser Thr Cys Pro Ala Glu Ser Ser Thr Asn Thr Thr 
45 50 55 

cag gat gag cag cgc agg tgg cca ggc tgt gac cag cag gac gag atg 1984 
Gin Asp Glu Gin Arg Arg Trp Pro Gly Cys Asp Gin Gin Asp Glu Met 
60 65 70 75 

etc aac ctg ggc ttc acc att ggt tec ttc gtg etc age gcc acc acc 2032 
Leu Asn Leu Gly Phe Thr He Gly ser Phe Val Leu Ser Ala Thr Thr 
80 85 90 

ctg cca ctg ggg ate etc atg gac cgc ttt ggc ccc cga ccc gtg egg 2080 
Leu Pro Leu Gly He Leu Met Asp Arg Phe Gly Pro Arg Pro Val Arg 
95 100 105 

ctg gtt ggc agt gcc tgc ttc act gcg tec tgc acc etc atg gcc ctg 2128 
Leu Val Gly Ser Ala Cys Phe Thr Ala Ser Cys Thr Leu Met Ala Leu 
no X15 120 

gcc tec egg gac gtg gaa get ctg tct ccg ttg ata ttc ctg gcg ctg 2176 
Ala Ser Arg Asp Val Glu Ala Leu Ser Pro Leu He Phe Leu Ala Leu 
125 130 135 

tec ctg aat ggc ttt ggt ggc ate tgc eta acg ttc act tea etc acg 2224 
Ser Leu Asn Gly Phe Gly Gly He Cys Leu Thr Phe Thr Ser Leu Thr 
3-4° 145 150 155 

ctg ccc aac atg ttt ggg aac ctg cgc tec acg tta atg gcc etc atg 2272 
Leu Pro Asn Met Phe Gly Asn Leu Arg Ser Thr Leu Met Ala Leu Met 
160 165 170 
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att ggc tct tac gcc tct tct gcc att acg ttc cca gga ate aag ctg 232 0 

lie Gly Ser Tyr Ala Ser Ser Ala lie Thr Phe Pro Gly He Lys Leu 

175 180 185 

ate tac gat gcc ggt gtg gcc ttc gtg gtc ate atg ttc acc tgg tct 2368 

lie Tyr Asp Ala Gly Val Ala Phe Val Val He Met Phe Thr Trp Ser 

190 195 200 

ggc ctg gcc tgc ctt ate ttt ctg aac tgc acc etc aac tgg ccc ate 2416 

Gly Leu Ala Cys Leu He Phe Leu Asn Cys Thr Leu Asn Trp Pro He 

205 210 215 

gaa gcc ttt cct gcc cct gag gaa gtc aat tac acg aag aag ate aag 2464 

Glu Ala Phe Pro Ala Pro Glu Glu Val Asn Tyr Thr Lys Lys He Lys 

220 225 230 235 

ctg agt ggg ctg gcc ctg gac cac aag gtg aca ggt gac etc ttc tac 2512 

Leu Ser Gly Leu Ala Leu Asp His Lys Val Thr Gly Asp Leu Phe Tyr 

240 245 250 

acc cat gtg acc acc atg ggc cag agg etc age cag aag gcc ccc age 256 0 

Thr His Val Thr Thr Met Gly Gin Arg Leu Ser Gin Lys Ala Pro Ser 

255 260 265 

ctg gag gac ggt teg gat gcc ttc atg tea ccc cag gat gtt egg ggc 26 0 8 

Leu Glu Asp Gly Ser Asp Ala Phe Met Ser Pro Gin Asp Val Arg Gly 

270 275 280 

acc tea gaa aac ctt cct gag agg tct gtc ccc tta cgc aag age etc 2656 

Thr Ser Glu Asn Leu Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu 

285 290 295 

tgc tec ccc act ttc ctg tgg age etc etc acc atg ggc atg acc cag 2704 

Cys Ser Pro Thr Phe Leu Trp Ser Leu Leu Thr Met Gly Met Thr Gin 

300 305 310 315 

ctg egg ate ate ttc tac atg get get gtg aac aag atg ctg gag tac 2752 

Leu Arg He He Phe Tyr Met Ala Ala Val Asn Lys Met Leu Glu Tyr 

320 325 330 

ctt gtg act ggt ggc cag gag cat gag aca aat gaa cag caa caa aag 280 0 

Leu Val Thr Gly Gly Gin Glu His Glu Thr Asn Glu Gin Gin Gin Lys 

335 340 345 

gtg gca gag aca gtt ggg ttc tac tec tec gtc ttc ggg gcc atg cag 2848 

Val Ala Glu Thr Val Gly Phe Tyr Ser Ser Val Phe Gly Ala Met Gin 

350 355 360 

ctg ttg tgc ctt etc acc tgc ccc etc att ggc tac ate atg gac tgg 2896 

Leu Leu Cys Leu Leu Thr Cys Pro Leu He Gly Tyr He Met Asp Trp 

365 370 375 

c <39 ate aag gac tgc gtg gac gcc cca act cag ggc act gtc etc gga 2 944 

Arg He Lys Asp Cys Val Asp Ala Pro Thr Gin Gly Thr Val Leu Gly 

380 385 390 395 

gat gcc agg gac ggg gtt get acc aaa tec ate aga cca cgc tac tgc 2992 
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Asp Ala Arg Asp Gly Val Ala Thr Lys Ser lie Arg Pro Arg Tyr Cys 

400 405 410 

aag ate caa aag etc acc aat gec ate agt gec ttc acc ctg acc aac 3040 

Lys lie Gin Lys Leu Thr Asn Ala He Ser Ala Phe Thr Leu Thr Asn 

415 420 4?5 

ctg ctg ctt gtg ggt ttt ggc ate acc tgt etc ate aac aac tta cac 3 088 

Leu Leu Leu Val Gly Phe Gly He Thr Cys Leu He Asn Asn Leu His 
430 435 440 

etc cag ttt gtg acc ttt gtc ctg cac acc att gtt cga ggt ttc ttc 3136 

Leu Gin Phe Val Thr Phe Val Leu His Thr He Val Arg Gly Phe Phe 
445 450 455 

cac tea gec tgt ggg agt etc tat get gca gtg ttc cca tec aac cac 3184 

His Ser Ala Cys Gly Ser Leu Tyr Ala Ala Val Phe Pro Ser Asn His 
460 465 470 475 

ttt ggg acg ctg aca ggc ctg cag tec etc ate agt get gtg ttc gee 3 232 

Phe Gly Thr Leu Thr Gly Leu Gin Ser Leu lie Ser Ala Val Phe Ala 
480 485 490 

ttg ctt cag cag cca ctt ttc atg gcg atg gtg gga ccc ctg aaa gga 32 8 0 

Leu Leu Gin Gin Pro Leu Phe Met Ala Met Val Gly Pro Leu Lys Gly 

495 500 505 

gag ccc ttc tgg gtg aga gcg agg gtt ggt gtg ggg gga gca gga gec 3328 

Glu Pro Phe Trp Val Arg Ala Arg Val Gly Val Gly Gly Ala Gly Ala 
510 515 520 

act etc ctg ggg gca ggg gta ggg cct tgt atg tgg tgc cat ccc tea 3 3 76 

Thr Leu Leu Gly Ala Gly Val Gly Pro Cys Met Trp Cys His Pro Ser 
525 530 535 

etc ate tea gec aga ggc acc tea gag gtc tct aat ctg cag gtt tec 3424 

Leu He Ser Ala Arg Gly Thr Ser Glu Val Ser Asn Leu Gin Val Ser 
540 545 550 555 

aag ttg tct gee ttt tag 3442 
Lys Leu Ser Ala Phe 
560 



<210> 4 

<211> 560 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ala Pro Thr Leu Gin Gin Ala 

1 5 
Cys Thr Ala Val Leu Glu Asn Leu 
20 

Trp Gly Ser Leu Leu He He Leu 

35 40 
Thr Cys Pro Ala Glu Ser Ser Thr 



Tyr Arg Arg Arg Trp Trp Met Ala 

10 15 
Phe Phe Ser Ala Val Leu Leu Gly 
25 30 
Lys Asn Glu Gly Phe Tyr Ser Ser 
45 

Asn Thr Thr Gin Asp Glu Gin Arg 
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50 55 60 

Arg Trp Pro Gly Cys Asp Gin Gin Asp Glu Met Leu Asn Leu Gly Phe 
65 70 75 80 

Thr lie Gly Ser Phe Val Leu Ser Ala Thr Thr Leu Pro Leu Gly lie 

85 90 95 

Leu Met Asp Arg Phe Gly Pro Arg Pro Val Arg Leu Val Gly Ser Ala 

100 105 110 

Cys Phe Thr Ala Ser Cys Thr Leu Met Ala Leu Ala Ser Arg Asp Val 

115 120 125 

Glu Ala Leu Ser Pro Leu lie Phe Leu Ala Leu Ser Leu Asn Gly Phe 

130 135 140 

Gly Gly lie Cys Leu Thr Phe Thr Ser Leu Thr Leu Pro Asn Met Phe 
145 150 155 160 

Gly Asn Leu Arg Ser Thr Leu Met Ala Leu Met lie Gly Ser Tyr Ala 

165 170 175 

Ser Ser Ala lie Thr Phe Pro Gly lie Lys Leu He Tyr Asp Ala Gly 

180 185 190 

Val Ala Phe Val Val He Met Phe Thr Trp Ser Gly Leu Ala Cys Leu 

195 200 205 

He Phe Leu Asn Cys Thr Leu Asn Trp Pro He Glu Ala Phe Pro Ala 

210 215 220 

Pro Glu Glu Val Asn Tyr Thr Lys Lys He Lys Leu Ser Gly Leu Ala 
225 230 235 240 

Leu Asp His Lys Val Thr Gly Asp Leu Phe Tyr Thr His Val Thr Thr 

245 ' 250 255 

Met Gly Gin Arg Leu Ser Gin Lys Ala Pro Ser Leu Glu Asp Gly Ser 

260 265 270 

Asp Ala Phe Met Ser Pro Gin Asp Val Arg Gly Thr Ser Glu Asn Leu 

275 280 285 

Pro Glu Arg Ser Val Pro Leu Arg Lys Ser Leu Cys Ser Pro Thr Phe 

290 295 300 

Leu Trp Ser Leu Leu Thr Met Gly Met Thr Gin Leu Arg He He Phe 
305 310 315 320 

Tyr Met Ala Ala Val Asn Lys Met Leu Glu Tyr Leu Val Thr Gly Gly 

325 330 335 

Gin Glu His Glu Thr Asn Glu Gin Gin Gin Lys Val Ala Glu Thr Val 

340 345 350 

Gly Phe Tyr Ser Ser Val phe Gly Ala Met Gin Leu Leu Cys Leu Leu 

355 360 365 

Thr Cys Pro Leu He Gly Tyr He Met Asp Trp Arg He Lys Asp Cys 

370 375 380 

Val Asp Ala Pro Thr Gin Gly Thr Val Leu Gly Asp Ala Arg Asp Gly 
385 390 395 400 

Val Ala Thr Lys Ser He Arg Pro Arg Tyr Cys Lys He Gin Lys Leu 

405 410 415 

Thr Asn Ala He Ser Ala Phe Thr Leu Thr Asn Leu Leu Leu Val Gly 

420 425 430 

Phe Gly He Thr Cys Leu He Asn Asn Leu His Leu Gin Phe Val Thr 

435 440 445 

Phe Val Leu His Thr He Val Arg Gly Phe Phe His Ser Ala Cys Gly 

450 455 460 

Ser Leu Tyr Ala Ala Val Phe Pro Ser Asn His Phe Gly Thr Leu Thr 
465 470 475 480 

Gly Leu Gin Ser Leu He Ser Ala Val Phe Ala Leu Leu Gin Gin Pro 

485 490 495 

Leu Phe Met Ala Met Val Gly Pro Leu Lys Gly Glu Pro Phe Trp Val 
500 505 510 
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Arg Ala Arg Val Gly Val 

515 

Gly val Gly Pro cys Met 
530 

Gly Thr Ser Glu Val Ser 
545 550 



Gly Gly Ala Gly Ala Thr 
520 

Trp Cys His Pro Ser Leu 

535 540 
Asn Leu Gin Val Ser Lys 
555 
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Leu Leu Gly Ala 
525 

lie Ser Ala Arg 

Leu Ser Ala Phe 
560 



<210> 5 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Arbitrary primer A2 from Stratagene , Inc. 
<400> 5 

aatctagagc tccagcag 18 



<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Zinc finger- directed primer 

<400> 6 

gtcgtcgaat tccacacagg agaaaagcc 2 9 



<210> 7 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 7 

gcatgttaca ggtagaaaag cc 22 

<210> 8 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 8 

ctggcgtatc tgaagagtct g 21 

<210> 9 

<211> 103 

<212> DNA 

<213> Homo sapiens 



<400> 9 

acaggaatcc ccaggagtga agaataagca ggaggcccca gattcacctt tagggcaagg 6 0 

agagagaaac agagtcaagt aggtagtcat ctgcccttaa gcc 103 

<210> 10 
<211> 20 
<212> DNA 
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<213> Homo sapiens 

<400> 10 
gaccgcatag acttctcaga 

<210> 11 

<21l> 22 

<212> DNA 

<213> Homo sapiens 

<400> 11 
tctgcaaagt ggctgagatg ag 

<210> 12 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 12 
cctgccttat ctttctgaac tgcacc 

<210> 13 

<211> 14 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope 

<400> 13 

Thr Gin Asp Glu Gin Arg Arg Trp Pro 
1 5 

<210> 14 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope 
<400> 14 

Glu Asn Leu Pro Glu Arg Ser Val Pro 
1 5 

<210> 15 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope 
<400> 15 

Arg Pro Arg Tyr Cys Lys lie Gin Lys 
1 5 
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20 



22 



26 



Gly Cys Asp Gin Gin 
10 



Leu Arg Lys Ser Leu 
10 



Leu Thr Asn Ala 
10 
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<210> 16 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Epitope 
<400> 16 

Ala Asn Gly Met Gly Pro Leu Lys Val Leu Ser Gly Ser 
15 10 

<210> 17 
<211> 13 
<212> P£T 

<213> Artificial Sequence 
<220> 

<223> Epitope 
<400> 17 

Ala Arg Gly Thr Ser Glu Val Ser Asn Leu Gin Val Ser 

15 10 
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